Ransomware encryption algorithm determination

ABSTRACT

A computer implemented method of identifying an encryption algorithm used by a ransomware algorithm, the ransomware algorithm encrypting a data store of a target computer system using a searchable encryption algorithm, the method including intercepting an ordered plurality of messages communicated from the target computer system to a ransomware server computer system, each message including a payload storing an encrypted unit of data from the target computer system; inspecting a final byte in the encrypted unit of data in each message to identify a byte value used by an encryption algorithm of the ransomware as a padding byte to pad messages to the size of an integral multiple of units of encryption for the encryption algorithm; training an autoencoder based on a position of a message in the ordered plurality of messages and the padding byte to provide a trained autoencoder adapted to differentiate the encryption algorithm used by the ransomware from other different encryption algorithms.

RELATED APPLICATION

The present application claims priority to European Application No.18193911.7 filed Sep. 12, 2018, which is hereby incorporated herein inits entirety by reference.

TECHNICAL FIELD

The present disclosure relates to the categorization of ransomware.

BACKGROUND

A ransomware attack involves an attacking computer system encryptingdata stored at a vulnerable target computer system—such as whole diskencryption—so as to prevent users of the target system from accessingthe data. Targets may be offered access to their data on receipt of apayment.

Accordingly it would be beneficial to mitigate such attacks.

SUMMARY

The present disclosure accordingly provides, in a first aspect, acomputer implemented method of identifying an encryption algorithm usedby a ransomware algorithm, the ransomware algorithm encrypting a datastore of a target computer system using a searchable encryptionalgorithm, the method comprising: intercepting an ordered plurality ofmessages communicated from the target computer system to a ransomwareserver computer system, each message including a payload storing anencrypted unit of data from the target computer system; inspecting afinal byte in the encrypted unit of data in each message to identify abyte value used by an encryption algorithm of the ransomware as apadding byte to pad messages to the size of an integral multiple ofunits of encryption for the encryption algorithm; training anautoencoder based on a position of a message in the ordered plurality ofmessages and the padding byte to provide a trained autoencoder adaptedto differentiate the encryption algorithm used by the ransomware fromother different encryption algorithms.

In one embodiment the method further comprises: for each of a set ofcandidate searchable encryption algorithms: a) encrypting a sample dataset; b) requesting and receiving an ordered plurality of elements of theencrypted data set using locations indicated in an index generated bythe candidate encryption algorithm; c) inspecting a final byte of eachelement; and d) invoking the trained autoencoder based on a position ofeach element in the ordered plurality of elements and a final bytecorresponding to each element so as to determine if the candidatesearchable encryption algorithm matches the encryption algorithm used bythe ransomware.

The present disclosure accordingly provides, in a second aspect, acomputer system including a processor and memory storing computerprogram code for performing the method set out above.

The present disclosure accordingly provides, in a third aspect, acomputer program element comprising computer program code to, whenloaded into a computer system and executed thereon, cause the computerto perform the method set out above.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present disclosure will now be described, by way ofexample only, with reference to the accompanying drawings, in which:

FIG. 1 is a block diagram a computer system suitable for the operationof embodiments of the present disclosure.

FIG. 2 is a component diagram of an arrangement including a ransomwareidentifier according to embodiments of the present disclosure.

FIG. 3 is a flowchart of a method of identifying a ransomware algorithmaccording to embodiments of the present disclosure.

FIG. 4 is a component diagram of an arrangement including an encryptionalgorithm identifier according to embodiments of the present disclosure.

FIG. 5 is a flowchart of a method of identifying an encryption algorithmused by a ransomware algorithm according to embodiments of the presentdisclosure.

FIG. 6 is a component diagram of an arrangement including a monitor fordetermining a plurality of data sources providing seed parameters of anencryption algorithm according to embodiments of the present disclosure.

FIG. 7 is a flowchart of a method for determining a plurality of datasources providing seed parameters of an encryption algorithm accordingto embodiments of the present disclosure.

FIG. 8 is a flowchart of a method for decrypting an encrypted data storeat a target computer system encrypted by a ransomware algorithm inaccordance with embodiments of the present disclosure.

DETAILED DESCRIPTION OF THE DRAWINGS

In a ransomware attack, an attacker may refrain from providing completedecryption in order to pursue an ongoing program of extortion byproviding only partial access to the maliciously encrypted data. Forexample, a victim may be compelled to pay an agent of the attacker toaccess particular data such as data that only exists in the encrypteddisk, data that is rare, data that is valuable, confidential data,personal data and the like. Additionally or alternatively, a ransomwareattacker may seek to benefit from access to data at a target system byunauthorized data access, unauthorized data usage and/or data theft. Forexample, payment information such as credit card details, personalinformation such as name, address and other personal identification orother sensitive information may be stolen by an attacker. To achievesuch targeted data theft, attackers identify such potentially valuableinformation within the data of a target system.

To these ends, attackers employ searchable encryption technologies (asare well known in the art) to selectively decrypt data stored on avictim system. Searchable encryption involves the generation of an indexduring the encryption process to categorize and identify parts of theencrypted data for subsequent selective decryption. For example,sensitive data, financial information, personal confidential informationand the like may be selected for special indexing.

Different ransomware attacks will have different characteristics thatmust be taken into account to inform, inter alia, a nature, order andspeed of defensive and responsive measures that may be taken in aphysical or virtual computer system or network of such computer systemswhen ransomware is detected. For example, a rate of encryption, a natureand rate of propagation of malicious software employed by a ransomwareattacker, a nature, extent and reliability of any paid-for decryption.Such characteristics, and others that will be apparent to those skilledin the art, may impact how an organization should react to a ransomwareattack. Reactive measures can involve: determining an extent ofisolation required for a network of connected systems within anorganization (e.g. is the ransomware likely confined or widely spread ata point in time following detection?); determining an extent of spreadof ransomware (e.g. are network appliances, peripherals and networkstorage implicated?); whether a remediation or mitigation mechanism isknown; whether the attacker is cooperative; and others. Accordingly, itis beneficial to categorize ransomware to determine attributes forinforming and selecting reactive measures.

FIG. 1 is a block diagram of a computer system suitable for theoperation of embodiments of the present disclosure. A central processorunit (CPU) 102 is communicatively connected to a storage 104 and aninput/output (I/O) interface 106 via a data bus 108. The storage 104 canbe any read/write storage device such as a random access memory (RAM) ora non-volatile storage device. An example of a non-volatile storagedevice includes a disk or tape storage device. The I/O interface 106 isan interface to devices for the input or output of data, or for bothinput and output of data. Examples of I/O devices connectable to I/Ointerface 106 include a keyboard, a mouse, a display (such as a monitor)and a network connection.

FIG. 2 is a component diagram of an arrangement including a ransomwareidentifier 218 according to embodiments of the present disclosure. Aserver 202 is a computer system involved in delivering, triggering,prompting or serving a ransomware attack on a target computer system206. For example, the ransomware attack can be effected by deliveringmalicious software (ransomware 204) to the target computer system 206 toencrypt data 208 stored at the target computer system 206. Theransomware 204 employs a searchable encryption (se) algorithm 210 toencrypt the data at the target computer system 206. In doing so, theencryption algorithm 210 generates a searchable encryption index 212that is communicated to the server 202.

Embodiments of the present disclosure exploit the method of operation ofransomware and the mechanism of ransomware attacks to identifyransomware attacks undertaken using an identifiable ransomware algorithmsuch that responsive actions 214 known to be effective, appropriate,occasioned or otherwise warranted in response to a particular ransomware204 can be effected. Thus, a ransomware identifier 216 component is ahardware, software, firmware or combination component communicativelyconnected to the target computer system 206 and a communication meansthrough which the ransomware server 202 communicates therewith, such asa computer network. The ransomware identifier 216 actively exposes thetarget computer system 206 to the ransomware algorithm 204. The data 208stored by target computer system 206 is a predetermined data set suchthat it can be reconstituted, replicated and reused. In one embodiment,the data 208 includes data that may be actively indexed by ransomwaresuch as data of value to a malicious entity including, inter alia:personal sensitive information such as names, addresses, contactinformation; financial information such as bank account information,credit card details, debit card details, online banking credentials andthe like; payment information; data marked confidential; data markedsecret; a private encryption key; a digital signature; usernameinformation; password, passphrase, personal identification number, orother access control credentials; and other data as will be apparent tothose skilled in the art.

During exposure of the target computer system 206 to the ransomware 204the data 208 becomes encrypted by the ransomware 204 using thesearchable encryption algorithm 210, including the generation of theencryption index 212. The ransomware identifier 216 intercepts the index212 which can be provided in plaintext form. Subsequently, theransomware identifier trains an autoencoder 218 based on the index suchthat the autoencoder 218 is trained to recognize the particularransomware 204 based on the index 212 generated by the ransomware 204for data 208. Notably, different ransomware algorithms will cause thegeneration of different indices for a number of reasons including: adifferent emphasis or preference of different ransomware algorithms fordifferent types of data stored in the data set 208, for example, someransomwares will seek to index all personal data while others mightfocus only on credit card numbers and the like; and the differentsearchable encryption algorithms employed by different ransomwares willresult in different indexes.

Thus, the autoencoder 218 can be trained using index data to recognizeindices generated by ransomware 204. One arrangement for generatinginput data for training (or, indeed, testing) the autoencoder 218 isoutlined below.

The index 212 will generally consist of a series of locations within theencrypted form of data 208, each location identifying a particular dataitem or type of data of interest. Such locations will therefore occuracross a range of locations from a lowest location (or offset) in theencrypted data to a highest location (or offset) in the data. In oneembodiment, such an index is converted to a suitable input vector forthe autoencoder 218 as follows:

-   -   1. Normalize each index location in the range [0 . . . 1]. Such        normalization can be achieved by:

$\frac{{{index}\mspace{14mu}{location}} - {{lowest}\mspace{14mu}{location}}}{{{highest}\mspace{14mu}{location}} - {{lowest}\mspace{14mu}{location}}}$

-   -   where: index location is the location (or offset) of a current        index entry being processed; lowest location is the lowest        location (or offset) in the index; and highest location is the        highest location (or offset) indicated in the index.    -   2. All normalized index entries are discretized by association        with slots in a range [0 . . . 1] with the slot size (width)        being determined by:

$\frac{1}{{{highest}\mspace{14mu}{location}} - {{lowest}\mspace{14mu}{location}}}$

-   -   Thus, if locations range from 50 to 150 then the slot size is

$\frac{1}{150 - 50} = \frac{1}{100}$

-   -   and thus slots will occur at [0, 0.01, 0.02, 0.03 . . . ].    -   3. Map each normalized index entries to a slot in the        discretized range of slots. Locating an appropriate slot can use        any suitable and consistent approach such as: rounding down to        the nearest slot; or rounding up to the nearest slot; or        truncating etc.    -   4. A count of entries for each slot can now be generated, and        once counted, each slot assumes a normalized value depending on        the lowest and highest counts for all slots. Thus, each slot        ultimately has a normalized value in the range [0 . . . 1].    -   5. The normalized slot values are used to constitute an input        vector for training the autoencoder.

Once trained, the autoencoder 218 can be further used to determine if asubsequent ransomware matches the one used to train the autoencoder.Thus, responsive to a subsequent ransomware attack using an unknownransomware, the ransomware identifier 216 exposes a computer systemhaving the predetermined set of sample data to the unknown ransomware toeffect encryption of the data by a searchable encryption algorithm ofthe unknown ransomware. Subsequently, an index generated by the unknownransomware can be intercepted and used to generate an input vector forthe trained autoencoder 218 using the steps outlined above. The inputvector so processed is then fed into the autoencoder 218 to determine ifthe autoencoder 218 is able to recognize the input vector as indicativethat the index generated by the unknown ransomware is indicative of theunknown ransomware being the same as the ransomware 204 used to trainthe autoencoder 218. Thus, in this way appropriate responsive actions214 associated with a ransomware 204 can be selected for the unknownransomware as appropriate.

In one embodiment, the autoencoder 218 is trained using multipletraining examples based on indices generated from repeated exposures ofthe target computer system 206 to the ransomware 204. Further, in oneembodiment, the autoencoder 218 is trained using multiple trainingexamples based on indices from a plurality of different ransomwarealgorithms to which the target computer system 206 is exposed todiscriminate ransomware algorithms.

FIG. 3 is a flowchart of a method of identifying a ransomware algorithmaccording to embodiments of the present disclosure. Initially, at 302,the method exposes the target computer system 206 to the ransomware 204.At 304 a searchable encryption index 212 is intercepted and used togenerate training input vector(s) to train the autoencoder 218 at step306. At 308 the method determines if a new ransomware attack isdetected, and if so, the method exposes a computer system with thepredetermined sample data to the ransomware in the attack. At 310 themethod executes the trained autoencoder 218 using an input vectorgenerated from a searchable index of the ransomware used in the attack.At 312 the method determines if the ransomware is recognized by theautoencoder 218 and, if recognized, the method selects and effectsresponsive actions associated with the recognized ransomware at step314.

FIG. 4 is a component diagram of an arrangement including an encryptionalgorithm identifier 422 according to embodiments of the presentdisclosure. Many of the features of FIG. 4 are identical to thosedescribed above with respect to FIG. 2 and these will not be repeatedhere. The encryption algorithm identifier 422 of FIG. 4 is a software,hardware, firmware or combination component arranged to determine whichone of a set of candidate searchable encryption algorithms 430 is usedby the ransomware 204 to encrypt the data 208. This is achieved by theencryption algorithm identifier 422 intercepting an ordered plurality ofmessages 420 communicated from the target computer system 206 to theransomware server 202. Such messages are responses by the ransomwareacting on the target computer system 206 to requests made by the server202 for encrypted data from the data store 208 at locations in the index212. For example, where the server 202 requests to receive encryptedcredit card information stored in the data store 208, the location ofsuch credit card information is determined by the server 202 in theindex 212 and data at that location is requested from the targetcomputer system 206. The messages 420 constitute responses to suchrequests and are ordered temporally according to the requests.

Each message 420 includes a message payload storing an encrypted unit ofdata (data unit) from the target computer system. Different encryptionalgorithms can operate on blocks (or units) of data of different sizes.For example, 64 byte blocks, 128 byte blocks, 256 byte blocks and otherencryption block sizes as will be apparent to those skilled in the art.Accordingly, the data unit in the payload of messages 420 will be anintegral multiple of blocks (units) of encryption for an encryptionalgorithm employed by the ransomware 204. Where the actual datarequested by the server does not constitute such an integral multiple ofencryption blocks, then the data unit in the message payload will bepadded using padding characters (bytes). These padding characters mayvary within the same encryption algorithm across different messages in asequence of messages, though within one message the same character willoccur. Further, across an ordered sequence of messages, commonality canoccur—such as commonality of the sequence of padding charactersemployed.

The encryption algorithm identifier 422 uses these padding characters tocharacterize an encryption algorithm by training an autoencoder 426(notably, a different autoencoder to that described with respect toFIGS. 2 and 3). Initially, a padding byte identifier 424 identifies apadding byte for a message payload as a last byte in the data unit ofthe payload. The last byte is used because, where padding takes place,padding is at the end of the data unit. The autoencoder 426 is thentrained based on the padding byte used by the encryption algorithm ofthe ransomware. The autoencoder 426 is trained using multiple trainingvectors arising from the padding bytes identified in each of an orderedsequence of message payload data units. In this way, the autoencoder 426encodes the characteristics of the padding bytes and the order ofpadding bytes across multiple messages.

The nature of the training vector will now be described according to anexemplary embodiment. The padding byte extracted as the last byte can beassumed to be taken from a subset of all byte values. In someembodiments, all possible values of a character set may be employed, orall values of a byte (0 to 255). By way of example, the 62 byte values[a . . . z], [A . . . Z] and [0 . . . 9] are considered. The byte valuefor a padding byte of a particular message in the ordered sequence ofmessages is combined with the position in the ordered sequence toconstitute an input vector. Thus, the autoencoder 426 in the exemplaryembodiment has input units for each possible byte value for eachpossible sequence value. In a preferred embodiment, the autoencoder 426is a restricted Boltzmann machine having hidden units according to anumber of messages in the ordered sequence of messages, such that eachhidden unit corresponds to a position in the ordered sequence.

Thus, when trained, the autoencoder 426 is adapted to differentiateencryption algorithms used by ransomwares. The identification of aparticular encryption algorithm from the set of candidate algorithms 430can also be achieved using an algorithm matcher 428. The operation ofthe algorithm matcher 428 is outlined below.

The sample data set 432 (corresponding to the data set 208 stored at thetarget computer system) is encrypted by each algorithm in the set ofcandidate searchable algorithms 430, each algorithm also generating asearchable encryption index. Each version of the encrypted sample dataset is then used to request and receive an ordered plurality of elementsof the encrypted data set using locations indicated in a correspondingindex. A final byte of each element is then used, along with a positionin the ordered set of the element, to constitute an input vector for thetrained autoencoder 426. The trained autoencoder 426 is then invokedwith the input vector to determine if the autoencoder 426 recognizes thecandidate searchable encryption algorithm. In this way, a particularencryption algorithm from the candidate set can be associated with theautoencoder 426 trained for a particular ransomware 204, so identifyingthe searchable encryption algorithm for the ransomware.

FIG. 5 is a flowchart of a method of identifying an encryption algorithmused by a ransomware algorithm according to embodiments of the presentdisclosure. Initially, at 502, the method intercepts messages in anordered plurality of messages 420 from the target computer system 206 tothe server 202. At 504 the method inspects a final byte of an encrypteddata unit in a message payload to identify a padding byte. At 506 theautoencoder 426 is trained based on the padding bytes and the positionof each message in the ordered plurality of messages. At 508, for eachsearchable encryption algorithm in the candidate set of algorithms 430,the method performs 510 to 518. At 510 the algorithm matcher 428encrypts the sample data set 432 using a current candidate algorithm. At512 the algorithm matcher 428 requests an ordered plurality of encryptedelements from the data set 432. At 514 the algorithm matcher 428 invokesthe trained autoencoder 426 based on the final (padding) byte of eachelement and the position of each element in the ordered plurality todetermine, at 516, if the autoencoder 426 recognizes the candidateencryption algorithm. Where there is recognition, the candidateencryption algorithm is associated with the ransomware 204 at 520.Otherwise, the flowchart repeats for all candidate algorithms 430 at518.

An encryption algorithm used by a ransomware will require the generationof an encryption key. Ransomware servers may not manage keys for allinfected target computer systems because such management is resourceintensive and introduces a vulnerability of key storage. Accordingly, aransomware will utilize immutable characteristics of a target computersystem to generate a key at the time of ransomware infection in orderthat the same key can be reliably generated by a ransomware server inrespect of the same target computer system subsequently. The key will,thus, be generated based on seed data or parameters arising from thetarget computer system that cannot be expected to change. i.e. datarelating to hardware features of the target computer system such as oneor more of any or all of, inter alia: a central processing unit; amemory; a storage device; a peripheral device; a basic input/outputsubsystem; an output device; an input device; a network device; andother hardware as will be apparent to those skilled in the art. Dataabout such hardware components can include, inter alia: a referencenumber; an identifier; a version; a date; a time; an address; a serialnumber; and/or any unique information about one or more hardwarecomponents as will be apparent to those skilled in the art.

FIG. 6 is a component diagram of an arrangement including a monitor 642for determining a plurality of data sources providing seed parameters ofan encryption algorithm according to embodiments of the presentdisclosure. Many of the features of FIG. 6 are the same as thosedescribed above with respect to FIG. 2 and these will not be repeatedhere. On infection by a ransomware 204, the target computer system 206will be used to generate an encryption key. To access data abouthardware components, devices, features and the like calls will be madeto or via an operating system (OS) 640 of the target computer system.Embodiments of the present invention provide a monitor 642 formonitoring application programming interface (API) calls made to theoperating system 640 to identify a set of one or more calls forretrieving data about one or more hardware components of the targetcomputer system 206. The data about such components is then determinedto constitute the seed parameters for the generation of an encryptionkey by the ransomware 204.

In some embodiments, the timing of the monitoring by the monitor 642 isselected to coincide with a period when generation of the encryption keycan be expected. Thus, the target computer system 206 is exposed to theransomware 204 intentionally and, at the point of initial exposure andbefore encryption commences, monitoring of the API calls is performed.The commencement of encryption can be detected by a sudden increase instorage activity—such as disk input/output activity—arising from theprocess of reading, encrypting and writing data 208 to storagedevice(s).

In one embodiment, the monitor 642 uses a process monitor to identifyAPI calls, such process monitors being commonly available as part of, orto supplement, operating systems of computer systems.

FIG. 7 is a flowchart of a method for determining a plurality of datasources providing seed parameters of an encryption algorithm accordingto embodiments of the present disclosure. At 702 the method exposes thetarget computer system 206 to the ransomware 204. At 704 the monitor 642monitors API calls to or via the operating system 40 to identify callsretrieving (or possibly useful for retrieving) data about hardwarecomponents of the target computer system. At 706 the method determinesdata about hardware retrieved via the API calls detected at 704 toconstitute seed parameters for the generation of an encryption key forthe ransomware 204.

Previously described embodiments serve to identify ransomware, determinea searchable encryption algorithm used by the ransomware and determineseed information for the generation of an encryption key for theransomware. The combination of these techniques can be further appliedto remediate a ransomware infection by decrypting a data store encryptedby a ransomware.

FIG. 8 is a flowchart of a method for decrypting an encrypted data storeat a target computer system encrypted by a ransomware algorithm inaccordance with embodiments of the present disclosure. At 802 asearchable encryption algorithm used by the ransomware is determined.For example, the techniques described above with respect to FIGS. 4 and5 can be employed. At 804, seed parameters used by the encryptionalgorithm for key generation are determined. For example, the techniquesdescribed above with respect to FIGS. 6 and 7 can be employed. Theparticular order of seed parameters used in the key generation processcan be determined by trial and error using, for example, software.Furthermore, the key generation algorithm required can be determinedbased on the identified encryption algorithm from 802. Subsequently, at806, an encryption key for the ransomware infection is generated usingthe seed information determined at 804 and the encryption algorithmdetermined at 802. Finally, at 808, data encrypted by a ransomware isdecrypted using the encryption algorithm determined at 802 and the keygenerated at 808.

Insofar as embodiments of the disclosure described are implementable, atleast in part, using a software-controlled programmable processingdevice, such as a microprocessor, digital signal processor or otherprocessing device, data processing apparatus or system, it will beappreciated that a computer program for configuring a programmabledevice, apparatus or system to implement the foregoing described methodsis envisaged as an aspect of the present disclosure. The computerprogram may be embodied as source code or undergo compilation forimplementation on a processing device, apparatus or system or may beembodied as object code, for example.

Suitably, the computer program is stored on a carrier medium in machineor device readable form, for example in solid-state memory, magneticmemory such as disk or tape, optically or magneto-optically readablememory such as compact disk or digital versatile disk etc., and theprocessing device utilizes the program or a part thereof to configure itfor operation. The computer program may be supplied from a remote sourceembodied in a communications medium such as an electronic signal, radiofrequency carrier wave or optical carrier wave. Such carrier media arealso envisaged as aspects of the present disclosure. It will beunderstood by those skilled in the art that, although the presentdisclosure has been described in relation to the above described exampleembodiments, the invention is not limited thereto and that there aremany possible variations and modifications which fall within the scopeof the invention. The scope of the present invention includes any novelfeatures or combination of features disclosed herein. The applicanthereby gives notice that new claims may be formulated to such featuresor combination of features during prosecution of this application or ofany such further applications derived therefrom. In particular, withreference to the appended claims, features from dependent claims may becombined with those of the independent claims and features fromrespective independent claims may be combined in any appropriate mannerand not merely in the specific combinations enumerated in the claims.

The invention claimed is:
 1. A computer implemented method ofidentifying a ransomware encryption algorithm used by a ransomware, theransomware encrypting a data store of a target computer system using asearchable encryption algorithm, the method comprising: intercepting anordered plurality of messages communicated from the target computersystem to a ransomware server computer system, each message of theordered plurality of messages including a payload storing an encryptedunit of data from the target computer system; inspecting a final byte inthe encrypted unit of data in each message of the ordered plurality ofmessages to identify a byte value used by the ransomware encryptionalgorithm of the ransomware as a padding byte to pad messages to a sizeof an integral multiple of units of encryption for the ransomwareencryption algorithm; and training an autoencoder based on a position ofa message in the ordered plurality of messages and the padding byte toprovide a trained autoencoder adapted to differentiate the ransomwareencryption algorithm used by the ransomware from other differentencryption algorithms.
 2. The method of claim 1, further comprising: foreach of a set of candidate searchable encryption algorithms: encryptinga sample data set; requesting and receiving an ordered plurality ofelements of the encrypted sample data set using locations indicated inan index generated by the candidate searchable encryption algorithm;inspecting a final byte of each element of the ordered plurality ofelements; and invoking the trained autoencoder based on a position ofeach element in the ordered plurality of elements and a final bytecorresponding to each element in the ordered plurality of elements so asto determine if the candidate searchable encryption algorithm matchesthe ransomware encryption algorithm used by the ransomware.
 3. Acomputer system comprising: a processor and memory storing computerprogram code for identifying a ransomware encryption algorithm used by aransomware, the ransomware encrypting a data store of a target computersystem using a searchable encryption algorithm, by: intercepting anordered plurality of messages communicated from the target computersystem to a ransomware server computer system, each message of theordered plurality of messages including a payload storing an encryptedunit of data from the target computer system; inspecting a final byte inthe encrypted unit of data in each message of the ordered plurality ofmessages to identify a byte value used by the ransomware encryptionalgorithm of the ransomware as a padding byte to pad messages to a sizeof an integral multiple of units of encryption for the ransomwareencryption algorithm; and training an autoencoder based on a position ofa message in the ordered plurality of messages and the padding byte toprovide a trained autoencoder adapted to differentiate the ransomwareencryption algorithm used by the ransomware from other differentencryption algorithms.
 4. A non-transitory computer-readable storageelement storing computer program code to, when loaded into a computersystem and executed thereon, cause the computer system to perform amethod of identifying a ransomware encryption algorithm used by aransomware, the ransomware encrypting a data store of a target computersystem using a searchable encryption algorithm, the method comprising:intercepting an ordered plurality of messages communicated from thetarget computer system to a ransomware server computer system, eachmessage of the ordered plurality of messages including a payload storingan encrypted unit of data from the target computer system; inspecting afinal byte in the encrypted unit of data in each message of the orderedplurality of messages to identify a byte value used by the ransomwareencryption algorithm of the ransomware as a padding byte to pad messagesto a size of an integral multiple of units of encryption for theransomware encryption algorithm; and training an autoencoder based on aposition of a message in the ordered plurality of messages and thepadding byte to provide a trained autoencoder adapted to differentiatethe ransomware encryption algorithm used by the ransomware from otherdifferent encryption algorithms.